Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 585058 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 53.6 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 2 |
| Text | 2 |
CRS_ARR_TIME is highly overall correlated with CRS_DEP_TIME | High correlation |
CRS_DEP_TIME is highly overall correlated with CRS_ARR_TIME | High correlation |
CRS_ELAPSED_TIME is highly overall correlated with DISTANCE | High correlation |
DISTANCE is highly overall correlated with CRS_ELAPSED_TIME | High correlation |
ARR_DELAY has 10416 (1.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-07-26 20:45:57.464575 |
|---|---|
| Analysis finished | 2024-07-26 20:46:07.170174 |
| Duration | 9.71 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
DAY_OF_MONTH
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.223844 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 16 |
| Q3 | 24 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8801536 |
|---|---|
| Coefficient of variation (CV) | 0.54735199 |
| Kurtosis | -1.1852382 |
| Mean | 16.223844 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.025358995 |
| Sum | 9491890 |
| Variance | 78.857128 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 20220 | 3.5% |
| 13 | 20193 | 3.5% |
| 31 | 20033 | 3.4% |
| 17 | 20027 | 3.4% |
| 21 | 19995 | 3.4% |
| 20 | 19945 | 3.4% |
| 10 | 19758 | 3.4% |
| 28 | 19694 | 3.4% |
| 27 | 19684 | 3.4% |
| 23 | 19556 | 3.3% |
| Other values (21) | 385953 |
| Value | Count | Frequency (%) |
| 1 | 17503 | |
| 2 | 17986 | |
| 3 | 17561 | |
| 4 | 16169 | |
| 5 | 18779 | |
| 6 | 19347 | |
| 7 | 18974 | |
| 8 | 17749 | |
| 9 | 17810 | |
| 10 | 19758 |
| Value | Count | Frequency (%) |
| 31 | 20033 | |
| 30 | 19476 | |
| 29 | 17331 | |
| 28 | 19694 | |
| 27 | 19684 | |
| 26 | 19287 | |
| 25 | 18414 | |
| 24 | 20220 | |
| 23 | 19556 | |
| 22 | 18089 |
DAY_OF_WEEK
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0346496 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.0724925 |
|---|---|
| Coefficient of variation (CV) | 0.51367349 |
| Kurtosis | -1.3146608 |
| Mean | 4.0346496 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.048214599 |
| Sum | 2360504 |
| Variance | 4.2952252 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 97599 | |
| 7 | 93141 | |
| 6 | 88379 | |
| 4 | 79169 | |
| 5 | 77947 | |
| 3 | 76587 | |
| 2 | 72236 |
| Value | Count | Frequency (%) |
| 1 | 97599 | |
| 2 | 72236 | |
| 3 | 76587 | |
| 4 | 79169 | |
| 5 | 77947 | |
| 6 | 88379 | |
| 7 | 93141 |
| Value | Count | Frequency (%) |
| 7 | 93141 | |
| 6 | 88379 | |
| 5 | 77947 | |
| 4 | 79169 | |
| 3 | 76587 | |
| 2 | 72236 | |
| 1 | 97599 |
AIRLINE
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.9 MiB |
| Southwest Airlines Co. | |
|---|---|
| Delta Air Lines Inc. | |
| American Airlines Inc. | |
| United Air Lines Inc. | |
| SkyWest Airlines Inc. | |
| Other values (10) |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 19.827665 |
| Min length | 9 |
Characters and Unicode
| Total characters | 11600334 |
|---|---|
| Distinct characters | 38 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Endeavor Air Inc. |
|---|---|
| 2nd row | Endeavor Air Inc. |
| 3rd row | Endeavor Air Inc. |
| 4th row | Endeavor Air Inc. |
| 5th row | Endeavor Air Inc. |
Common Values
| Value | Count | Frequency (%) |
| Southwest Airlines Co. | 124840 | |
| Delta Air Lines Inc. | 88066 | |
| American Airlines Inc. | 81377 | |
| United Air Lines Inc. | 62271 | |
| SkyWest Airlines Inc. | 56262 | |
| Republic Airline | 22650 | 3.9% |
| Alaska Airlines Inc. | 22571 | 3.9% |
| JetBlue Airways | 21108 | 3.6% |
| Spirit Air Lines | 20650 | 3.5% |
| Envoy Air | 19348 | 3.3% |
| Other values (5) | 65915 |
Length
| Value | Count | Frequency (%) |
| inc | 364437 | |
| airlines | 322483 | |
| air | 218817 | |
| lines | 170987 | |
| southwest | 124840 | 6.8% |
| co | 124840 | 6.8% |
| delta | 88066 | 4.8% |
| american | 81377 | 4.4% |
| united | 62271 | 3.4% |
| skywest | 56262 | 3.1% |
| Other values (12) | 216000 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1348947 | |
| 1245322 | 10.7% | |
| n | 1093159 | 9.4% |
| e | 1036386 | 8.9% |
| r | 731746 | 6.3% |
| s | 718251 | 6.2% |
| A | 717340 | 6.2% |
| t | 524164 | 4.5% |
| l | 523578 | 4.5% |
| . | 489277 | 4.2% |
| Other values (28) | 3172164 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7925367 | |
| Uppercase Letter | 1940368 | 16.7% |
| Space Separator | 1245322 | 10.7% |
| Other Punctuation | 489277 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1348947 | |
| n | 1093159 | |
| e | 1036386 | |
| r | 731746 | |
| s | 718251 | |
| t | 524164 | 6.6% |
| l | 523578 | 6.6% |
| c | 468464 | 5.9% |
| o | 299587 | 3.8% |
| a | 285241 | 3.6% |
| Other values (11) | 895844 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 717340 | |
| I | 364437 | |
| S | 218061 | 11.2% |
| L | 170987 | 8.8% |
| C | 124840 | 6.4% |
| D | 88066 | 4.5% |
| U | 62271 | 3.2% |
| W | 56262 | 2.9% |
| E | 35805 | 1.8% |
| R | 22650 | 1.2% |
| Other values (5) | 79649 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1245322 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 489277 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9865735 | |
| Common | 1734599 | 15.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1348947 | |
| n | 1093159 | |
| e | 1036386 | |
| r | 731746 | 7.4% |
| s | 718251 | 7.3% |
| A | 717340 | 7.3% |
| t | 524164 | 5.3% |
| l | 523578 | 5.3% |
| c | 468464 | 4.7% |
| I | 364437 | 3.7% |
| Other values (26) | 2339263 |
Common
| Value | Count | Frequency (%) |
| 1245322 | ||
| . | 489277 | 28.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11600334 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1348947 | |
| 1245322 | 10.7% | |
| n | 1093159 | 9.4% |
| e | 1036386 | 8.9% |
| r | 731746 | 6.3% |
| s | 718251 | 6.2% |
| A | 717340 | 6.2% |
| t | 524164 | 4.5% |
| l | 523578 | 4.5% |
| . | 489277 | 4.2% |
| Other values (28) | 3172164 |
ORIGIN
Text
| Distinct | 336 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.9 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1755174 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AUS |
|---|---|
| 2nd row | CVG |
| 3rd row | IND |
| 4th row | JFK |
| 5th row | DTW |
| Value | Count | Frequency (%) |
| atl | 29406 | 5.0% |
| dfw | 25688 | 4.4% |
| den | 24914 | 4.3% |
| ord | 21995 | 3.8% |
| lax | 17253 | 2.9% |
| clt | 16565 | 2.8% |
| las | 15561 | 2.7% |
| sea | 15520 | 2.7% |
| phx | 13876 | 2.4% |
| mco | 13315 | 2.3% |
| Other values (326) | 390965 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 200976 | 11.5% |
| L | 160262 | 9.1% |
| S | 151410 | 8.6% |
| D | 139059 | 7.9% |
| T | 93498 | 5.3% |
| O | 89468 | 5.1% |
| C | 89116 | 5.1% |
| M | 79765 | 4.5% |
| F | 73200 | 4.2% |
| N | 68866 | 3.9% |
| Other values (16) | 609554 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1755174 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 200976 | 11.5% |
| L | 160262 | 9.1% |
| S | 151410 | 8.6% |
| D | 139059 | 7.9% |
| T | 93498 | 5.3% |
| O | 89468 | 5.1% |
| C | 89116 | 5.1% |
| M | 79765 | 4.5% |
| F | 73200 | 4.2% |
| N | 68866 | 3.9% |
| Other values (16) | 609554 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1755174 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 200976 | 11.5% |
| L | 160262 | 9.1% |
| S | 151410 | 8.6% |
| D | 139059 | 7.9% |
| T | 93498 | 5.3% |
| O | 89468 | 5.1% |
| C | 89116 | 5.1% |
| M | 79765 | 4.5% |
| F | 73200 | 4.2% |
| N | 68866 | 3.9% |
| Other values (16) | 609554 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1755174 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 200976 | 11.5% |
| L | 160262 | 9.1% |
| S | 151410 | 8.6% |
| D | 139059 | 7.9% |
| T | 93498 | 5.3% |
| O | 89468 | 5.1% |
| C | 89116 | 5.1% |
| M | 79765 | 4.5% |
| F | 73200 | 4.2% |
| N | 68866 | 3.9% |
| Other values (16) | 609554 |
DEST
Text
| Distinct | 336 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.9 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1755174 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RDU |
|---|---|
| 2nd row | AUS |
| 3rd row | JFK |
| 4th row | DTW |
| 5th row | MBS |
| Value | Count | Frequency (%) |
| atl | 29363 | 5.0% |
| dfw | 25729 | 4.4% |
| den | 24773 | 4.2% |
| ord | 21878 | 3.7% |
| lax | 17289 | 3.0% |
| clt | 16560 | 2.8% |
| las | 15643 | 2.7% |
| sea | 15542 | 2.7% |
| phx | 13879 | 2.4% |
| mco | 13229 | 2.3% |
| Other values (326) | 391173 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 201174 | 11.5% |
| L | 160358 | 9.1% |
| S | 151733 | 8.6% |
| D | 138834 | 7.9% |
| T | 93582 | 5.3% |
| O | 89270 | 5.1% |
| C | 89042 | 5.1% |
| M | 79663 | 4.5% |
| F | 73182 | 4.2% |
| N | 68847 | 3.9% |
| Other values (16) | 609489 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1755174 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 201174 | 11.5% |
| L | 160358 | 9.1% |
| S | 151733 | 8.6% |
| D | 138834 | 7.9% |
| T | 93582 | 5.3% |
| O | 89270 | 5.1% |
| C | 89042 | 5.1% |
| M | 79663 | 4.5% |
| F | 73182 | 4.2% |
| N | 68847 | 3.9% |
| Other values (16) | 609489 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1755174 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 201174 | 11.5% |
| L | 160358 | 9.1% |
| S | 151733 | 8.6% |
| D | 138834 | 7.9% |
| T | 93582 | 5.3% |
| O | 89270 | 5.1% |
| C | 89042 | 5.1% |
| M | 79663 | 4.5% |
| F | 73182 | 4.2% |
| N | 68847 | 3.9% |
| Other values (16) | 609489 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1755174 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 201174 | 11.5% |
| L | 160358 | 9.1% |
| S | 151733 | 8.6% |
| D | 138834 | 7.9% |
| T | 93582 | 5.3% |
| O | 89270 | 5.1% |
| C | 89042 | 5.1% |
| M | 79663 | 4.5% |
| F | 73182 | 4.2% |
| N | 68847 | 3.9% |
| Other values (16) | 609489 |
CRS_DEP_TIME
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1230 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1333.6551 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 600 |
| Q1 | 905 |
| median | 1322 |
| Q3 | 1747 |
| 95-th percentile | 2145 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 842 |
Descriptive statistics
| Standard deviation | 503.71329 |
|---|---|
| Coefficient of variation (CV) | 0.37769381 |
| Kurtosis | -1.0778364 |
| Mean | 1333.6551 |
| Median Absolute Deviation (MAD) | 422 |
| Skewness | 0.093413594 |
| Sum | 7.8026561 × 108 |
| Variance | 253727.08 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 11962 | 2.0% |
| 700 | 8604 | 1.5% |
| 800 | 5149 | 0.9% |
| 630 | 3702 | 0.6% |
| 615 | 3556 | 0.6% |
| 900 | 3549 | 0.6% |
| 1000 | 3353 | 0.6% |
| 730 | 3279 | 0.6% |
| 830 | 3019 | 0.5% |
| 715 | 2847 | 0.5% |
| Other values (1220) | 536038 |
| Value | Count | Frequency (%) |
| 1 | 14 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 27 | |
| 9 | 2 | < 0.1% |
| 10 | 38 | |
| 11 | 4 | < 0.1% |
| 12 | 4 | < 0.1% |
| 14 | 7 | < 0.1% |
| 15 | 65 |
| Value | Count | Frequency (%) |
| 2359 | 999 | |
| 2358 | 67 | < 0.1% |
| 2357 | 117 | < 0.1% |
| 2356 | 57 | < 0.1% |
| 2355 | 227 | < 0.1% |
| 2354 | 57 | < 0.1% |
| 2353 | 82 | < 0.1% |
| 2352 | 8 | < 0.1% |
| 2351 | 4 | < 0.1% |
| 2350 | 135 | < 0.1% |
CRS_ARR_TIME
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1305 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1471.9446 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 657 |
| Q1 | 1045 |
| median | 1503 |
| Q3 | 1922 |
| 95-th percentile | 2300 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 877 |
Descriptive statistics
| Standard deviation | 537.72241 |
|---|---|
| Coefficient of variation (CV) | 0.36531429 |
| Kurtosis | -0.47719119 |
| Mean | 1471.9446 |
| Median Absolute Deviation (MAD) | 432 |
| Skewness | -0.303095 |
| Sum | 8.6117299 × 108 |
| Variance | 289145.39 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2359 | 2973 | 0.5% |
| 1900 | 1834 | 0.3% |
| 2100 | 1795 | 0.3% |
| 1810 | 1736 | 0.3% |
| 1915 | 1721 | 0.3% |
| 1855 | 1711 | 0.3% |
| 1140 | 1695 | 0.3% |
| 2145 | 1691 | 0.3% |
| 905 | 1648 | 0.3% |
| 2000 | 1639 | 0.3% |
| Other values (1295) | 566615 |
| Value | Count | Frequency (%) |
| 1 | 75 | < 0.1% |
| 2 | 152 | < 0.1% |
| 3 | 274 | < 0.1% |
| 4 | 149 | < 0.1% |
| 5 | 742 | |
| 6 | 74 | < 0.1% |
| 7 | 49 | < 0.1% |
| 8 | 74 | < 0.1% |
| 9 | 116 | < 0.1% |
| 10 | 581 |
| Value | Count | Frequency (%) |
| 2359 | 2973 | |
| 2358 | 797 | 0.1% |
| 2357 | 687 | 0.1% |
| 2356 | 534 | 0.1% |
| 2355 | 1260 | |
| 2354 | 489 | 0.1% |
| 2353 | 549 | 0.1% |
| 2352 | 559 | 0.1% |
| 2351 | 279 | < 0.1% |
| 2350 | 935 | 0.2% |
CRS_ELAPSED_TIME
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 445 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 145.61639 |
| Minimum | 23 |
|---|---|
| Maximum | 671 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 64 |
| Q1 | 91 |
| median | 128 |
| Q3 | 175 |
| 95-th percentile | 310 |
| Maximum | 671 |
| Range | 648 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 73.367314 |
|---|---|
| Coefficient of variation (CV) | 0.50383966 |
| Kurtosis | 2.0907867 |
| Mean | 145.61639 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 1.3606067 |
| Sum | 85194035 |
| Variance | 5382.7628 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 85 | 11728 | 2.0% |
| 90 | 11646 | 2.0% |
| 80 | 10073 | 1.7% |
| 70 | 9312 | 1.6% |
| 110 | 8755 | 1.5% |
| 115 | 8737 | 1.5% |
| 75 | 8590 | 1.5% |
| 95 | 7877 | 1.3% |
| 105 | 7413 | 1.3% |
| 135 | 7377 | 1.3% |
| Other values (435) | 493550 |
| Value | Count | Frequency (%) |
| 23 | 31 | < 0.1% |
| 26 | 30 | < 0.1% |
| 33 | 31 | < 0.1% |
| 34 | 62 | < 0.1% |
| 35 | 113 | < 0.1% |
| 36 | 241 | < 0.1% |
| 37 | 335 | |
| 38 | 357 | |
| 39 | 420 | |
| 40 | 837 |
| Value | Count | Frequency (%) |
| 671 | 7 | < 0.1% |
| 670 | 21 | |
| 666 | 16 | < 0.1% |
| 665 | 1 | < 0.1% |
| 655 | 31 | |
| 645 | 9 | < 0.1% |
| 615 | 5 | < 0.1% |
| 602 | 7 | < 0.1% |
| 600 | 15 | < 0.1% |
| 585 | 52 |
DISTANCE
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1476 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 846.28638 |
| Minimum | 31 |
|---|---|
| Maximum | 5095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | 31 |
|---|---|
| 5-th percentile | 177 |
| Q1 | 402 |
| median | 683 |
| Q3 | 1076 |
| 95-th percentile | 2253 |
| Maximum | 5095 |
| Range | 5064 |
| Interquartile range (IQR) | 674 |
Descriptive statistics
| Standard deviation | 610.84477 |
|---|---|
| Coefficient of variation (CV) | 0.72179441 |
| Kurtosis | 2.3567987 |
| Mean | 846.28638 |
| Median Absolute Deviation (MAD) | 326 |
| Skewness | 1.4233379 |
| Sum | 4.9512662 × 108 |
| Variance | 373131.34 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 337 | 3273 | 0.6% |
| 399 | 2551 | 0.4% |
| 594 | 2478 | 0.4% |
| 296 | 2383 | 0.4% |
| 404 | 2354 | 0.4% |
| 447 | 2238 | 0.4% |
| 862 | 2231 | 0.4% |
| 100 | 2195 | 0.4% |
| 328 | 2157 | 0.4% |
| 867 | 2136 | 0.4% |
| Other values (1466) | 561062 |
| Value | Count | Frequency (%) |
| 31 | 61 | < 0.1% |
| 41 | 62 | < 0.1% |
| 61 | 51 | < 0.1% |
| 67 | 385 | |
| 68 | 90 | < 0.1% |
| 69 | 21 | < 0.1% |
| 70 | 57 | < 0.1% |
| 73 | 741 | |
| 74 | 175 | < 0.1% |
| 75 | 415 |
| Value | Count | Frequency (%) |
| 5095 | 44 | |
| 4983 | 106 | |
| 4962 | 18 | < 0.1% |
| 4817 | 10 | < 0.1% |
| 4502 | 62 | |
| 4475 | 44 | |
| 4243 | 62 | |
| 4213 | 10 | < 0.1% |
| 4184 | 44 | |
| 3972 | 62 |
ARR_DELAY
Real number (ℝ)
ZEROS 
| Distinct | 1358 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.641825 |
| Minimum | -86 |
|---|---|
| Maximum | 3337 |
| Zeros | 10416 |
| Zeros (%) | 1.8% |
| Negative | 312657 |
| Negative (%) | 53.4% |
| Memory size | 8.9 MiB |
Quantile statistics
| Minimum | -86 |
|---|---|
| 5-th percentile | -24 |
| Q1 | -13 |
| median | -2 |
| Q3 | 20 |
| 95-th percentile | 116 |
| Maximum | 3337 |
| Range | 3423 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 72.101824 |
|---|---|
| Coefficient of variation (CV) | 4.3325671 |
| Kurtosis | 152.87673 |
| Mean | 16.641825 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 9.0734374 |
| Sum | 9736433 |
| Variance | 5198.673 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -10 | 15439 | 2.6% |
| -11 | 15407 | 2.6% |
| -9 | 15156 | 2.6% |
| -12 | 15027 | 2.6% |
| -8 | 14946 | 2.6% |
| -7 | 14693 | 2.5% |
| -13 | 14628 | 2.5% |
| -6 | 14129 | 2.4% |
| -14 | 14046 | 2.4% |
| -15 | 13315 | 2.3% |
| Other values (1348) | 438272 |
| Value | Count | Frequency (%) |
| -86 | 1 | < 0.1% |
| -85 | 1 | < 0.1% |
| -74 | 1 | < 0.1% |
| -73 | 2 | |
| -72 | 1 | < 0.1% |
| -71 | 1 | < 0.1% |
| -69 | 1 | < 0.1% |
| -68 | 2 | |
| -66 | 3 | |
| -65 | 2 |
| Value | Count | Frequency (%) |
| 3337 | 1 | |
| 2980 | 1 | |
| 2912 | 1 | |
| 2891 | 1 | |
| 2854 | 1 | |
| 2786 | 1 | |
| 2748 | 1 | |
| 2429 | 1 | |
| 2424 | 1 | |
| 2418 | 1 |
ARR_DEL15
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.9 MiB |
| 0.0 | |
|---|---|
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1755174 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 417978 | |
| 1.0 | 167080 | 28.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 417978 | |
| 1.0 | 167080 | 28.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1003036 | |
| . | 585058 | |
| 1 | 167080 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1170116 | |
| Other Punctuation | 585058 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1003036 | |
| 1 | 167080 | 14.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 585058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1755174 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1003036 | |
| . | 585058 | |
| 1 | 167080 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1755174 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1003036 | |
| . | 585058 | |
| 1 | 167080 | 9.5% |
| AIRLINE | ARR_DEL15 | ARR_DELAY | CRS_ARR_TIME | CRS_DEP_TIME | CRS_ELAPSED_TIME | DAY_OF_MONTH | DAY_OF_WEEK | DISTANCE | |
|---|---|---|---|---|---|---|---|---|---|
| AIRLINE | 1.000 | 0.155 | 0.025 | 0.060 | 0.050 | 0.175 | 0.015 | 0.027 | 0.173 |
| ARR_DEL15 | 0.155 | 1.000 | 0.164 | 0.256 | 0.253 | 0.069 | 0.080 | 0.078 | 0.066 |
| ARR_DELAY | 0.025 | 0.164 | 1.000 | 0.223 | 0.255 | 0.024 | 0.032 | 0.064 | 0.039 |
| CRS_ARR_TIME | 0.060 | 0.256 | 0.223 | 1.000 | 0.733 | 0.036 | 0.003 | 0.004 | 0.033 |
| CRS_DEP_TIME | 0.050 | 0.253 | 0.255 | 0.733 | 1.000 | -0.012 | 0.005 | 0.002 | -0.010 |
| CRS_ELAPSED_TIME | 0.175 | 0.069 | 0.024 | 0.036 | -0.012 | 1.000 | -0.004 | 0.016 | 0.984 |
| DAY_OF_MONTH | 0.015 | 0.080 | 0.032 | 0.003 | 0.005 | -0.004 | 1.000 | -0.014 | -0.005 |
| DAY_OF_WEEK | 0.027 | 0.078 | 0.064 | 0.004 | 0.002 | 0.016 | -0.014 | 1.000 | 0.018 |
| DISTANCE | 0.173 | 0.066 | 0.039 | 0.033 | -0.010 | 0.984 | -0.005 | 0.018 | 1.000 |
| DAY_OF_MONTH | DAY_OF_WEEK | AIRLINE | ORIGIN | DEST | CRS_DEP_TIME | CRS_ARR_TIME | CRS_ELAPSED_TIME | DISTANCE | ARR_DELAY | ARR_DEL15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 6 | Endeavor Air Inc. | AUS | RDU | 1007 | 1412 | 185.0 | 1162.0 | -8.0 | 0.0 |
| 1 | 1 | 6 | Endeavor Air Inc. | CVG | AUS | 745 | 922 | 157.0 | 958.0 | 3.0 | 0.0 |
| 2 | 1 | 6 | Endeavor Air Inc. | IND | JFK | 710 | 928 | 138.0 | 665.0 | 24.0 | 1.0 |
| 3 | 1 | 6 | Endeavor Air Inc. | JFK | DTW | 1645 | 1900 | 135.0 | 509.0 | -7.0 | 0.0 |
| 4 | 1 | 6 | Endeavor Air Inc. | DTW | MBS | 2115 | 2210 | 55.0 | 98.0 | -2.0 | 0.0 |
| 5 | 1 | 6 | Endeavor Air Inc. | LGA | MCI | 1615 | 1836 | 201.0 | 1107.0 | -5.0 | 0.0 |
| 6 | 1 | 6 | Endeavor Air Inc. | ILM | LGA | 1335 | 1525 | 110.0 | 500.0 | -17.0 | 0.0 |
| 7 | 1 | 6 | Endeavor Air Inc. | LGA | ILM | 915 | 1116 | 121.0 | 500.0 | -20.0 | 0.0 |
| 8 | 1 | 6 | Endeavor Air Inc. | PVD | LGA | 700 | 810 | 70.0 | 143.0 | -21.0 | 0.0 |
| 9 | 1 | 6 | Endeavor Air Inc. | RDU | DCA | 600 | 717 | 77.0 | 227.0 | 13.0 | 0.0 |
| DAY_OF_MONTH | DAY_OF_WEEK | AIRLINE | ORIGIN | DEST | CRS_DEP_TIME | CRS_ARR_TIME | CRS_ELAPSED_TIME | DISTANCE | ARR_DELAY | ARR_DEL15 | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 601856 | 31 | 1 | Republic Airline | EWR | BOS | 845 | 1008 | 83.0 | 200.0 | -17.0 | 0.0 |
| 601857 | 31 | 1 | Republic Airline | DCA | LEX | 2140 | 2316 | 96.0 | 414.0 | -24.0 | 0.0 |
| 601858 | 31 | 1 | Republic Airline | LGA | DCA | 1320 | 1455 | 95.0 | 214.0 | -22.0 | 0.0 |
| 601859 | 31 | 1 | Republic Airline | BOS | LGA | 1100 | 1228 | 88.0 | 184.0 | -9.0 | 0.0 |
| 601860 | 31 | 1 | Republic Airline | IND | IAD | 1941 | 2120 | 99.0 | 476.0 | 27.0 | 1.0 |
| 601861 | 31 | 1 | Republic Airline | IAD | IND | 1705 | 1853 | 108.0 | 476.0 | 55.0 | 1.0 |
| 601862 | 31 | 1 | Republic Airline | IAD | SAV | 900 | 1059 | 119.0 | 515.0 | -26.0 | 0.0 |
| 601863 | 31 | 1 | Republic Airline | PWM | IAD | 530 | 715 | 105.0 | 493.0 | 16.0 | 1.0 |
| 601864 | 31 | 1 | Republic Airline | IAD | IND | 2207 | 2352 | 105.0 | 476.0 | 15.0 | 1.0 |
| 601865 | 31 | 1 | Republic Airline | SAV | IAD | 1400 | 1545 | 105.0 | 515.0 | -17.0 | 0.0 |